A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval
نویسندگان
چکیده
In Modern Mongolian, a content word can be inflected when concatenated with suffixes. Identifying the original forms of content words is crucial for natural language processing and information retrieval. We propose a lemmatization method for Modern Mongolian and apply our method to indexing for information retrieval. We use technical abstracts to show the effectiveness of our method experimentally.
منابع مشابه
Enhancing Lemmatization for Mongolian and its Application to Statistical Machine Translation
Lemmatization is crucial in natural language processing and information retrieval especially for highly inflected languages, such as Finnish and Mongolian. The state-of-the-art method of lemmatization for Mongolian does not need a noun dictionary and is scalable, but errors of this method are mainly caused by problems related to part of speech (POS) information. To resolve this problem, we inte...
متن کاملResearch on Reasoning and Retrieval Methods Based on Mongolian Curriculum Areas of Semantic Web
The backwardness of the Mongolian network teaching resources results in its low reuse rates and utilization. For this situation, a retrieval method of semantic web based on Mongolian curriculum areas was set up. Firstly, the method established the Mongolian ontology of course ‘Artificial Intelligence ( )’in area of teaching, it uses a relationship database MySQL to record ontology information, ...
متن کاملPerformance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature
Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...
متن کاملDesign and Realization of Mongolian Syntactic Retrieval System Based on Dependency Treebank
In the past seven years, Language Research Institute of Inner Mongolia University has constructed a 500,000word scale Mongolian dependency treebank. The syntactic treebank provides a favorable data platform for language research and information processing. In order to effectively use the treebank, we have designed and implemented a graphical syntactic information retrieval system based on the M...
متن کاملTesting and Validating the Role of Interactive Information Retrieval Model in Faculty Members' psychological Enabling: A Case Study of Alborz University of Medical Sciences
The term "electromagnetic fields" (EMF) is a combination of electric and magnetic fields as a diagnostic method as well as a therapeutic tool with many advantages such as ease of operation and painlessness, very controllable, which today has found wide application in regenerative medicine and also cancer treatment. In addition to organs such as nerves, hearts, and bones that have an electrica...
متن کامل